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Abstract 

In our work we define a new algebra of operators as a substitute 
for fuzzy logic. Its primary purpose is for construction of binary dis- 
criminators for phonemes based on spectral content. It is optimized 
for design of non-parametric computational circuits, and makes uses 
of 4 operations: min, max, the difference and generalized additively 
homogenuous means. 

1 Introduction 

Probability, statistics and in particular Bayesian statistics are disciplined 
foundation for understanding uncertainty phenomena. Fuzzy logic is a gen- 
eralization of Boolean logic that purports to provide another way to describe 
and reason with uncertainties PQ, [2]- A very useful feature of fuzzy logic is 

that it can be implemented in analog circuits. It serves as a basis for fuzzy 
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set theory and since its inception has found thousands of apphcations rang- 
ing from modelhng and prediction to industrial controllers. There is really 
not a single fuzzy logic, but rather many variations. Most of them however 
use min and max operators [3j. Recently, a new way to compute voltage- 
mode min and max operations using memristors [4j has been proposed in 
[5]. Unlike previous implementations using bipolar junction transistors [6], 
[7], and CMOS transistors [8], [9], the proposed circuit is passive. This has 
prompted research inquiries about computational power of circuits whose el- 
ements are mostly min and max operators, extending classical research on 
sorting networks [lOj that use min and max exclusively. 

First steps were taken in works [11], [12], [13], [H], [15] where authors find 
pattern recognition circuits using fuzzy logic operators. Their application 
domain is speech recognition. In this note we lay out our arguments why we 
believe that fuzzy logic may not be the best choice to use in this context. We 
propose an alternative that we call ii'S'-algebra The key principles 

guiding our proposal are: 

• simplicity, keeping the family of operations small to allow for more effi- 
cient optimization using evolutionary algorithms, which are commonly 
used to search for optima in discrete spaces, [I6].[T7] 

• richness, that would allow expression of desirable properties, such as 
being able to capture the concept of formant [18, page 12], or provide 
invariance under volume adjustment, 

• nonlinearity, by which we mean using nonlinear operators, such as min 
and max, as well as the absence of continuous parameters, which are 

2 



currently hard to implement in analog electrical circuits. 

This is the first paper in a series in which we will investigate phoneme 
discrimination circuits using this algebra. Let us note that it is possible to 
use the algebra also to design circuits for feature detection (e.g. lines, edges) 
in vision applications, but speed of computation when simulated on digital 
computers is not competitive. 

2 Basic properties of human speech process- 
ing 

In order to design an efficient speech recognition framework it is helpful to 
understand underlying mechanisms that allow humans process audio signal. 
If the framework takes the mechanisms into acccount, it is more likely to 
exhibit performance closer to humans. This is demonstrated by improved 
performance of mel-based cepstral coefficients, PLP [TU] or RASTA |20j . 

From physical point of view sound is simply an oscillation of air pressure. 
It is perceived by humans via a complex series of transformations. First, air 
pressure oscillation is conducted (and certain frequencies partially amplified) 
via ear canal to bones of middle ear. They transfer oscillations into vibrations 
of the basiliar membrane that in turn stimulates inner hair cells. By release 
of neurotransmitters mechanical energy is converted to electrical signals that 
are further processed by central neural system in which neurons communicate 
primarily by firing pulses. Increase of stimulation in inner hair cells translates 
to increase of firing rate of a neuron. There is however a limit to a neuron's 
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firing rate due to its refractor period. 

Tlie crucial point is tliat from computational point of view organ of Corti 
performs Fourier transform. That is, each section of the basiliar membrane 
is tuned to a narrow frequency band and neural response is commensurate to 
energy in that band. It is therefore natural to construct phonemic classifiers 
based on spectral data. 

3 Nonlinear means 

Our primary motivation is to discriminate between phonemes. It is well 
known that location of formants is an important characteristic of vowels. 
Thus would like to find ways other than LPC to quantify "peakness" of 
formants in the spectral envelope. One way to do that is to start with 
arithmetic means 



n 

and look for their generalizations. One obvious one is to use weighted linear 
combinations 



which leads to approaches like logistic regression or support vector machines. 

We wish to investigate nonlinear functions. In nonlinear algebras, useful 
simplifying laws such as commutativity or distributivity of operations do not 
need to hold. In order to obtain a reasonably small set of functions in non- 




Xi+ X2-\ \- X, 
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L{xi, . . . , Xn) = aiXi + ... QnX, 



n 
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linear algebra we will work with functions derived from symmetric operators. 
These are functions / invariant with respect to argument transpositions 

f{x\^ ■ ■ ■ 1 Xj, ■ ■ ■) — f{pC\i ■ ■ ■ 1 ^i+li ^ii ■ ■ ■) 

and thus also with respect to any permutations of its arguments. In fact, 
they are uniquely defined, once one specifies them on the whole cone Xi < 

Let us recall the classically known generalizations of the arithmetic mean. 
Quadratic mean is given by formula 
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the geometric mean by 

G{Xi,X2, . . . , Xn) = \J X\ • X2 - ■ ■ X^, 

and the harmonic mean by 

H{x,,X2, . . . , = ^ ^T' + ^2' + ■ ■ ■ + ^n' ^ -\ 



The following ordering between these means holds: 

if(x) < G'(x) < M(x) < M2(x). (1) 
One can define still more general means of which arithmetic, quadratic 
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and harmonic means are special cases. Namely, for a 7^ and x > set 



One can show that in fact 



lim Ma{xi, ...,Xn) = G{xi, . . . 



so that the geometric mean is also a member of the family. 
Proof. By L 'Hospital rule we have 



lim log M„(x) = lim ^ 



a 



a—i-0 a— s>0 (y 



a 



lim 
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In ^xi ■ ■ ■ Xr^ 



Inequalities ([T]) are just special cases of the following theorem: 
Theorem 1. If a < P then for nonnegative xi, . . . ,Xn we have 

Ma{Xi, ...,Xn) < Mf}{xi, ...,Xn). 

For a ^ 1 the means Ma are not linear, meaning the equality 
M„(x + x') = M,(x) + M,(x') 
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does not need to hold. However they are multiphcatively homogeneous, 
meaning 

M„(cx) = cM„(x). 

Since sound data is usually provided in logarithmic scale one may desire to 
possess means that are additively homogeneous so that 

F(c-(l,...,l)+x)=c + F(x) 

A family of such means is given by the formula 

Aa{xi, ...,Xn)= ln(iV4(exp(a;i), . . . , exp(a;„))) (2) 

where in particular 

Ao{xi, ...,Xn)^ ^^^"'^^'' , (3) 

Th 

lim Aa{xi,...,Xn)^ram(xi,...,Xn), (4) 

a— >— oo 

lim Aa{xi, . . . , x„) = max(xi, . . . , (5) 
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4 Quant iles 



Minimum and maximum arise naturally as the limiting cases of nonlinear 
means studied in the previous section, since 

lim Ma{xi, . . . , x„) = min(2;i, ...,Xn) 

a— >■— oo 

lim Ma{xi, ...,Xn) = max(a;i, ...,Xn) 

A simple, but important observation is that both min and max do not gen- 
erate new values. In particular evaluation of any expression of variables 
Xi, . . . ,Xn using only composition of min and max operators results in one of 
Xi, . . . , Xn- A natural class of symmetric operators in the algebra generated 
by min and max are quantiles. In fact, under continuity requirements they 
are the only symmetric n-ary operators in that algebra. 

Prom computer science point of view it is natural to ask how to compute 
quantiles using a given class of min and max gates. In this context one 
may ask for instance how many gates one needs, or what is the depth of the 
resulting feed- forward circuit. The answer differ depending on the kinds of 
min and max operators one allows. 

Suppose for instance that one allows min and max gates of arbitrary arity. 
Obviously n-ary min and n-ary max are both representable by one gate. Any 
other quantile can be represented by circuits of depth two, for instance the 
second largest element function can be found by computing either of the 
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following two expressions: 

min(max(x2, • • • , Xn), max(a;i, X3, . . . , x„), max(a;i, X2, 0:4, ...),.. .) 

max(min(a;i, X2), min(a;i, X3), . . . , min(xi, Xj), . . .) 

The restriction to using binary min and max gates is more delicate. Re- 
sults about computations of quantiles can be deduced from classical studies 
on sorting networks. Well-known is bitonic sorting network of Batcher, which 
yields explicit circuits for quantiles of n variables of depth O(log^n). Theo- 
retical improvement came in work [21] which showed that sorting networks 
with 0{nlogn) comparators and depth O(logn) exist, which is asymptoti- 
cally optimal. However the implied constant is quite large and research in 
this area is ongoing. 

For values of n up to 8, optimal networks are known. For instance, a 
sorting network with 8 inputs and optimal depth 6 is shown in Figure [l| 

5 Fuzzy logic 

Mathematical functions introduced so far {Ma, Aa and quantiles) are essen- 
tially monotonous. When spectral data increase they increase as well. In 
order to gain contrast it is useful to have a nonsymmetric operation of say 
two variables, "increasing" in the first and "decreasing" in the second. A 
well known framework that uses such functions is fuzzy logic. 

Fuzzy logic dates back to works of Lukasiewicz and Post. H. Weyl in 
1940 proposed a fuzzy logic where propositions are assigned values in the 
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Figure 1: Sorting network with 8 inputs and depth 6. Inputs are supphed on 
the left, outputs (sorted values) can be read out on the right. Each vertical 
wire signifies reordering of inputs on corresponding horizontal wires. 

unit interval. He generalized ordinary Boolean logic operators as follows: 

a and h = min(a, b) 
a OT b = max(a, b) 
not a = 1 — a 
a implies b = 1 — a + min{a, b) 
= min{l, 1 — a + b), 

where a and b take values in the interval [0, 1]. Many other operators in fuzzy 
logic have been since proposed. For instance instead one can use t-norms or 
t-conorms. Fuzzy logic gained prominence in the foundational paper on fuzzy 
sets written by L. Zadeh [22]. In this paper he used fuzzy logic as a means 
to represent vague linguistical and cognitive notions. 
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6 Eliminating negation 

We note that using min and max in fuzzy logic has been shown to be the 
only choice under natural assumptions [23j. Using negation is however prob- 
lematic for several reasons. 

The first reason arises from implementation issues one encounters when 
translating fuzzy logic expressions into electronic circuits. Both negation 
and implication such as Lukasiewicz implication need to be implemented 
with active electronic circuits. 

This issue can be partially addressed. Passive circuits whose inputs are 
normalized to [0, 1] can compute with negation a = not a = 1 — a, if those 
are prepared beforehand due to identities 



min(x, y) = max(x, y) 



max(a;, y) = min(a;, y) 



This idea is a special case of negation conversion for MIN-MAX-AVG circuits 
described in [21] (see Figure [2]) . 




NOT gace^ removed, aoss-liiik added 



Figure 2: How to convert MIN-MAX-NEGATION-AVG circuit to MIN- 
MAX-AVG circuit 
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The second reason lies in conflict with our understanding of auditory per- 
ception. Introducing negation implies providing lower and upper bounds on 
spectral data. Establishing a lower bound may be reasonable, because it 
should correspond to the smallest intensity of audible sound. However, an 
upper bound should probably reflect the largest sound a human ear can sus- 
tain without damage, or possibly the intensity threshold at which a harmonic 
causes the maximum possible firing rate in the auditory nerve. Both of these 
quantities are very hard to measure and are probably quite variable. It is 
therefore unclear why one should let such an upper bound affect information 
processing in phoneme classification circuits. 

A final disagreement is of philosophical nature. Fuzzy logic traditionally 
deals with uncertain notions primarily of linguistic nature, such as "tall", 
or "warm". Spectral content of sound is of course also uncertain, due to 
uncertainty principle of Fourier transform. However the uncertainty is of a 
different kind, taking form of a random variable [251 Chapter 10]. 

We thus propose to leave the realm of fuzzy logic and instead use the 
difference operator (this has value x — y for a pair of real valued variables x 
and y). If this operator is applied to two additively homogeneous expressions 
such as quantiles or generalized means Aa described earlier, it turns volume 
sensitive expressions into volume invariant expressions. This invariance is 
likely to be a desirable feature of classificators. 

We thus arrive at the definition of KS-algebra AiM.. An element of the 
algebra is any expression of spectral components of sound and the zero value, 
using operators 

• the minimum min(a;i, . . . , Xn), 
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• the maximum max(a;i, . . . , Xn), 

• the difference Xi — X2, 

• the additively homogeneous means Aa- 

In view of from ([s]), Q, (|5]), the addition of operators in the last category can 
be seen as constructing a hnk between hnear circuits and min-max circuits, 
allowing for direct comparison of performance of linear and nonlinear circuits. 

7 Binary Classifiers arising from AiA4 

An interesting experiment is described in work [26], [27] . A sentence was pro- 
cessed by a filter with frequency response approximating inverse to spectral 
envelope of a vowel. In spite of this, the vowel was recognized by speakers. 
It is thus not the absolute spectral content that determines phonemes, but 
rather relative. This is confirmed for instance also in work [28] . 

In our context, a binary classifier will consist of two ingredients. The 
first one is a function / of KS-algebra algebra The second ingredient 

describes what one concludes given an evaluation of / on spectral data s. 
We can distinguish the following natural variants of classifiers. 

• A Z-classifier is described by a function / G AiAi. It decides that 
a phoneme is of the first class if /(s) < 0, and decides that is of the 
second class if /(s) > 0. 

• A B-classifier consists of a pair (/, c), where / G AiAi and c is a real 
number. It decides that a phoneme is of the first class if /(s) < c, and 
concludes that is of the second class if /(s) > c. 
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• an A-classifier consists of a pair {f,c), where / G AiAi and c is a 
real number. It decides that a phoneme is of the first class if /i (s) < c, 
but makes no conclusion, if this condition is not met. 

From A-classifiers we will distinguish the subclass of A'^ classifiers that are 
charactized by the condition c < 0. 

Let us remark that unlike models with continuous parameters, the classi- 
ficators are really described by their structure and support, the latter being 
defined as follows. 



supp(O) = {} 
supp(si) = {i} 
supp(min(/i,/2)) = supp(/i) U supp(/2) 
supp(max(/i,/2)) = supp(/i) U supp(/2) 
supp(v4„(/i, ...,/„)) = |Jsupp(/-,) 

i 
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